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Abstract: We present a method to resolve combinatorial issues in multi-particle final states 
at hadron colliders. The use of kinematic variables such as Mt2 and invariant mass signifi- 
cantly reduces combinatorial ambiguities in the signal, but at a cost of losing statistics. We 
illustrate this idea with gluino pair production leading to 4 jets + in the final state as well 
as tt production in the dilepton channel. Compared to results in recent studies, our method 
provides greater efficiency with similar purity 
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1. Introduction 

The Tevatron Run II at Fermilab and the recent turn-on of the Large Hadron Collider (LHC) 
at CERN are beginning to explore the physics of the Terascale. There are sound theoretical 
reasons to believe that new physics beyond the Standard Model (BSM) is going to be revealed 
in those experiments. Perhaps the most compelling phenomenological evidence for BSM 
particles and interactions at the TeV scale is provided by the dark matter problem. It is 
a tantalizing coincidence that a neutral, weakly interacting massive particle (WIMP) in the 
TeV range can explain all of the observed dark matter in the Universe. A typical WIMP 
does not interact in the detector and can only manifest itself as missing energy. The WIMP 
idea therefore greatly motivates the study of missing energy signatures at the Tevatron and 
the LHC. Recently, new ideas of various kinematic methods for the determination of masses, 
spins, and couplings have attracted a lot of interest in missing energy signatures at colliders 
(see [1-3] for recent reviews). They not only offer methods for the determination of masses 
and spins but also provide good background rejection and enhance discovery potential. 

Unlike lepton colliders, the environment at hadron colliders is much more complex and 
often make such tasks difficult. One issue more prevalent in hadron colliders is the combi- 
natorial ambiguities in events, especially in events with jets. In general, the event topology 
at hadron colliders will typically contain a number of jets. Some come from the decays of 
heavier particles in the signal and others may originate from initial state radiation (ISR). 
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The problem gets worse due to detector limitations such as finite resolution, inability of dis- 
tinguishing quarks from anti-quarks, etc. All these jets pose a severe combinatorics problem: 
which one of the many jets in an event is the correct one to assign to a particular decay? 

Some of the existing studies in the literature simply take for granted that the correct jet 
can be somehow identified. Others select the jet by matching to the true quark jet in the 
event generator output, which of course is unobservable. The severity of the jet combinatorics 
problem is rather model dependent and in practice how well it can be dealt with depends 
on the individual case at hand. For example, if the mass spectrum is relatively broad, one 
might expect a jet from the decay of a heavier particle to be among the hardest in the event. 
This information can be used to improve the purity of the sample. Fortunately, there exists 
a method (the mixed event technique) which should, at least in principle, remove the effects 
from incorrect jet combinations [4]. This method has been successfully applied to measuring 
SUSY masses at the SPSla study point [5] (see [6] for a more recent study). However, this 
only works statistically and does not discriminate on an event-by-event basis. The existence 
of invisible particles adds difficulty in resolving combinatorial issues since it prohibits us from 
reconstructing the whole signal. 

Combinatorial issues appear at all different levels of an event (i.e. from ISR, decays 
of heavier particles, etc.). A general solution to all these problems would be very difficult. 
We may be able to achieve this statistically by relying on a likelihood analysis or matrix 
element methods [7]. However, this task requires knowledge of new physics which may not be 
available at the time of discovery. Therefore, it is desirable to find a solution that is model- 
independent. In fact, some studies on kinematic variables in the literature already indicate 
that these variables may be useful for mitigating combinatorial difficulties associated with 
ISR and the signal jets [8,9] ^. 

The ISR (rather than always proving to be an obstacle) can actually be helpful in the 
study of BSM physics. This has already been demonstrated in several recent studies. In 
Ref. [15], ISR was shown to make BSM events more prominent by giving pair-produced new- 
physics states something to recoil against, thus increasing both fix and Ht- For the Mt2 
variable (defined in Section 3.1 below) [16, 17], it is known that the transverse momentum 
of ISR makes the kink structure more pronounced (see references in [1]). Usually ISR over- 
estimates the expected end point, which changes depending on the size of the transverse 
momentum. To make better measurements one needs to understand systematically the effect 
of ISR as a function of transverse momentum [18-21]. 

In this paper, we concentrate on combinatorial ambiguities among the particles in the 
signal. It is important to resolve or mitigate this combinatorial background issue. Solutions 
can be directly used to improve experimental measurements such as the forward-backward 
asymmetry in the top quark pair production, the H^-hclicity, top mass measurements, etc. 
In terms of BSM, they obviously become more important since most BSM models predict a 
WIMP candidate which, when produced, will show up as missing transverse momentum in 
the detector. An appropriate understanding of combinatorics may even be able to reduce SM 
backgrounds. 

'^See [7,10-14] for different approaches. 
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In resolving combinatorial ambiguities in signal events, recently Ref. [22] argued that the 
proposed pT versus invariant mass method performs better than the well known hemisphere 
method [23]. This was shown for parton level events without detector effects, backgrounds or 
ISR. The results in the previous study are quoted in terms of the efficiency and purity of the 
samples. For the example point taken in [22] (the 3 body decay of 600 GeV gluino (g) into 
2 jets and a neutralino (x?) of 100 GeV), the authors have obtained 91.7% purity and 3.1% 
efficiency for the diquark system with an invariant mass less than 500 GeV and a transverse 
momentum greater than 450 GeV. 

The purpose of this paper to revisit the proposed method in [22] and improve the obtained 
efficiency and purity by utilizing the M'T2 variable in addition to invariant mass. Section |2| 
begins with a review of the method described in [22]. In Section ^ we propose our new method 
for resolving combinatorial ambiguities in the context of gluino pair production with 4 jets 
+ in the final state. Following the spirit of Ref. [22] we analyze our method under similar 
assumptions in order to make a suitable comparison. Then we determine the effectiveness 
of this method by considering different mass splittings, possible uncertainties in the gluino 
and neutralino masses, and ISR. Next, in Section we apply the same method to the ti 
dilepton system to resolve the two-fold ambiguity in pairing a b-tagged jet (6) and a lepton 
{£). Section |5| is reserved for our conclusions. 

2. The pt versus M method 

In this section we briefly review the px versus M method in Ref. [22]. For the discussion on 
combinatorial ambiguities at hadron colliders, the authors consider gluino pair production 
with each gluino decaying to the lightest neutralino and two quarks: gg — )■ qqqqXiXi- All 
quarks are treated at parton level and the effects of ISR and parton showering are not 
considered. Following the example used in [22], we consider the off-shell squark case {niq > 
rUg) with a gluino mass nig = 600 GeV and a neutralino mass m^o = 100 GeV (the on-shell 
case is similar). 

Without any ISR, we are left with 4 jets in the final state. In order to do any proper 
analysis, it needs to be determined which of the 3 possible pairings of jets is the correct 
combination. In Ref. [22], the authors explored the possibility of extracting the correct jet 
combination using the invariant mass and transverse momentum of each diquark system. 
They noticed two prominent features. First, they noted the kinematic edge in the diquark 
invariant mass distribution, which is especially visible for combinations with high pT- The 
excess of diquark combinations with an invariant mass larger than the kinematic edge value 
{rrig — m^o = 500 GeV) must all be incorrect combinations. Second, those combinations which 
have invariant mass larger than the kinematic edge (which are all incorrect combinations) tail 
off quickly towards larger diquark pT- Events below the edge (which are a mixture of correct 
and incorrect combinations) extend to higher px- Therefore, for the events with larger pT, 
the ratio of correct to incorrect combinations increases. 

To reproduce the results of Ref. [22] we use the MadGraph/MadEvent package [24] to 
generate a total of lOK events at the LHC with a center of mass energy of 7 TeV. Simulating 
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Figure 1: The invariant mass (a) and pr distributions (b) of tlic jet combinations in the final state 
of tlie gluino pair production, (c) Diquark transverse momentum versus invariant mass for the correct 
(in black) and incorrect (in yellow) jet combinations. The vertical and horizontal lines represent the 
cuts used to determine the correct jet combination. These distribution show the motivation for the 
cuts proposed in [22]. 



the effects caused by detector resolution in the hadronic calorimeter, the 4-momenta of the 
final state jets are smeared via 



E J E 

where a = 0.5 (0.1) VGeV and b = 0.03 (0.007) for jets (leptons) [23,25,26]. 

Fig. H shows (a) the pT and (b) invariant mass distributions for the correct (blue, solid), 
incorrect (red, dashed) and all (black, dot-dashed) combinations of jets. The different nor- 
malizations of the three histograms are understood since there are two incorrect pairs and 
one correct pair in each event. In the invariant mass distribution of the correct jet combina- 
tion there is a clear cutoff at rrig — m-o = 500 GeV. A small fraction of events beyond this 
kinematic edge occur due to detector resolution. In the px distribution at high value {pT ^ 
500 GeV) the ratio of correct to incorrect jet combinations becomes > 1. The pT cut is cho- 
sen such that only 5% of the combinations with an invariant mass above the kinematic edge 
(which are likely to be incorrect combinations) survive. As suggested, a cut on combinations 
with the invariant mass below the kinematic edge and above the pT cut can guarantee an 
event sample dominated by correct combinations. To extract the correct jet combination the 
following cuts are made: 

• The invariant mass of the diquark pair is less than 500 GeV. 

• The pt of the diquark pair is greater than 450 GeV. 

If only one combination of jets passes these cuts then the event is accepted, otherwise the 
event is discarded. Fig. |l|(c) shows the diquark transverse momentum versus invariant mass 
of the correct (black) and incorrect (yellow) jet combinations. The vertical and horizontal 
lines represent expected cuts to reduce the number of incorrect combinations. Counting those 
events that pass two cuts (i.e., the number of events in the left-upper corner of Fig. ||(c)), 
we find an event efficiency of 3.1% with an event purity of 91.7%. This is consistent with 
the results in Ref. [22]. Furthermore, a higher efficiency can be achieved at the cost of lower 
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purity by imposing a lower pT cut. It is also shown in [22] that this method performs better 
than the hemisphere method. 



3. Improved method 

We note that the method discussed in the previous section has a low efficiency while the 
corresponding purity is relatively high. This is due to the long tail in the pT distribution of 
the diquark system. In general, for a large mass splitting between the mother and daughter 
particles, the pT of the resulting diquark system is relatively large. Conversely, it is smaller 
for a narrower spectrum, i.e., the peak position in the pT distribution is shifted to a lower 
value in a narrower mass spectrum. Therefore there is a correlation between the location 
of the peak in the distribution and the mass difference. However the amount of shift (or 
the location of the peak) is not well enough predicted quantitatively for this observation 
to be useful, since the diquark system inherits some kinematic information of the mother 
particles at the production level. An ISR jet is also problematic since the mother particle 
system will be boosted in the opposite direction of the ISR resulting in modification of the 
Pt spectra. Unlike the invariant mass distribution, the pT distribution does not exhibit any 
kinematic edge. The distribution of the background (incorrect combinations) has a similar 
shape and, as argued, it decreases faster than the distribution of the correct combination 
above a certain value (Fig. |l|(b)). On the other hand, the invariant mass distribution shows 
a clear kinematic edge in the distribution of the correct combination but not in that of the 
incorrect combination, making the two distributions distinct (Fig. |l|(a)). 

One can recall a similar situation in the leptonic decay of the W boson at hadron colliders. 
The mass of the W boson in principle can be measured from the pT distribution of a decay 
product (a lepton). It shows a nice kinematic edge at half of the W mass when the W 
boson is produced at rest. However, in reality the transverse mass is used instead. It shows 
a kinematic edge at the mass of the W and the value of the end point is stable under the 
presence of initial state radiation. What variable would do such a job for the example in 
the previous section while increasing the event efficiency? In this paper, instead of cutting 
on Pt, we propose to use Mt2 as the second cut in addition to the invariant mass. It is the 
most natural extension of the transverse mass in the final state when more than one missing 
particle is present. As advertised, it shows a clear kinematic edge and is stable {i.e., keeping 
the end point at the same value) under a boost of the whole system. One should investigate 
how well it performs in terms of the event efficiency and purity. 

We consider the same process (4 jets and 2 neutralinos) as in the previous section without 
any ISR and discuss a method to improve the event efficiency and purity. Then, we discuss 
how our results change with different mass spectra and the effect of uncertainty in mass 
measurements. Finally, we look at the same process with ISR and discuss how our results 
change if we are correctly able to identify the ISR jet. First, we begin this section with a 
review of Mt2- 
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3.1 Brief review of Mt2 

The traditional Mt2 variable is defined as follows [16,17]. Consider a pair production of a 
mother particle, P, and their decays down to a daughter particle of mass M^. For any given 
event, one can construct the transverse mass, Mit, of each parent: 

Mf^ = m2 + Mj + 2{E,TEfT - p,T ■ pfr) , (3-1) 

where 

EiT = yjmf + |p>|2, EfT^^Juj^^^ (3.2) 

is the transverse energy of the visible particles, 1^, and daughter particle, d, in the decay 
branch of each mother particle, correspondingly. piT and rrij are the transverse momentum 
and invariant mass of the visible particles in branch i {i = 1,2). The individual momenta, pfj,, 
of the missing daughter particles, d, are unknown, but they are constrained by the measured 
missing transverse momentum, ^7^, in the event: 

pfx + Pir =fT = -Pt - PIT - P2T- (3.3) 

Here Pt is the Upstream Transverse Momentum (UTM), which is the transverse momentum 
of all other visible particles that are not considered in the decay chains. For the true values 
of the missing momenta, pfj., each transverse mass in Eq. 3.1 is bounded above by the true 
parent mass, Mp. 

This fact can be used in a rather ingenious way to define the Cambridge Mt2 variable 
[16,17]. One takes the larger of the two quantities in Eq. |3.l| and minimizes it over all possible 
partitions of the unknown daughter momenta, pfj,, subject to the constraint in Eq. 3.3 : 

MT2{Md,PT,PiT)= min {max {Afir, Mst}} . (3.4) 

Since the masses of the mother and daughter particles are not known in advance, one should 
treat Mt2 as a function of the trial mass (M^) of the daughter particle. For a given Pt, the 
endpoint M^2^ of this distribution gives the parent mass, Mp, as a function of the input 
trial daughter mass, M^. An important property of Mt2 is 

Mp{Md = Md) = M^^2"" (Md = Md) = Mp . (3.5) 

Then one is able to obtain a one dimensional relation between two masses. In cases with 
non-zero UTM and composite visible system, a kink structure appears, allowing simultaneous 
determination of both masses (see [1] and references therein). 

In the case of Pt = 0, the analytic expression for Mt2 can be obtained, while for Pt 7^ 
case there is no analytic solution to minimization and one has to rely on a numerical code for 
computation. A clever trick to get around this has been proposed in Ref. [20]. The idea is 
to project further the transverse quantities with respect to the non-zero UTM, in which case 
the momentum conservation in the direction perpendicular to the UTM is independent of the 
UTM (by construction) and one is able to use an analytic expression for the computation of 
projected Mt2- Mt2 has been extended to cases with non-identical mothers [27] and non- 
identical daughters [28]. In principle, one can use it for an associated production as well as 
processes with two different daughter particles. 
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Figure 2: (a) The Mt2 distribution of the correct and incorrect combinations of jets in the gluino 
pair production. For the correct jet combination a clear cutoff appears at the gluino mass, (b) The 
Mt2 versus invariant mass for the correct (in black) and incorrect (in yellow) jet combinations. The 
vertical and horizontal lines represent the cuts used to determine the correct jet combination. 



3.2 Case without ISR 

As an alternative method, instead of using transverse momentum, one uses Mt2 to help 
determine the correct jet combination. Fig. ^(a) shows the distribution for the correct 
(blue, solid), incorrect (red, dashed) and all (black, dot-dashed) combinations of jets. Note 
that the normalization of the distribution of incorrect (all) pairs is tv^^ice (three times) higher 
than that of the correct distribution. The distribution for the correct jet combination has a 
clear cutoff at the gluino mass {rrig = 600 GeV). We apply the same cut that was used in the 
Pt versus M method for the invariant mass of the jet combinations but now we also require 
require Mj'2 < 600 GeV. As before, we keep the events where only one combination of jets 
passes both cuts. Fig. ^(b) shows the correct (black) and incorrect combinations (yellow) in 
Mt2 versus invariant mass. For events where only one combination of jets pass the two cuts, 
we obtain an event efficiency of 20% with an event purity of 95% as shown in Table |l|. Thus 
we have a concrete example showing that an appropriate choice of kinematic variable cuts 
enhances both the efficiency and purity of our signal. 

Unlike the cuts using pT, a very small fraction of events with correct combinations are 
ruled out completely by our cuts {i.e., many events have 2 or 3 combinations that pass 
the M'T2 and invariant mass cuts). Table |l] shows the number of events passing our cuts 
broken down by the number of combinations that pass. With these cuts only 2% of the 
correct combinations are discarded and 98% of correct combinations survive, so it is possible 
to use more refined cuts to extract the correct combination from the events with 2 and 
3 combinations which could potentially increase the event efficiency even above 20%. In 
principle, one can apply a px cut at this point. However, as argued, pT is dependent on the 
mass spectrum and ISR, and perhaps it should be the last variable to make a cut on. Instead, 
we further increase the efficiency and purity of this procedure by simply taking a look at the 
events where only two combinations pass the cuts so far. 
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Number of passed 
combinations 


Number of events 


Number of events 
with correct combination 





70 




1 


2003 


1896 


2 


3135 


3076 


3 


4792 


4792 



Table 1: The breakdown of the number of events that have — 3 combinations that pass the Mt2 
and invariant mass cuts. The events with one correct combination give us a 20% (2003/10000) 
event efficiency with an event purity of 95% (1896/2003). With these cuts only 2% of the correct 
combinations are discarded so it is possible to use more refined cuts to extract the correct combination 
from the events with 2 and 3 combinations. 

Making further cuts allows us to extract the correct combination from the two that are 
left over. We discuss two such ways of extracting the correct combination using the invariant 
mass and M'T2 distributions. In Figs. ||(a) and ^a) we can see that for invariant masses 
and Mt2 that are below the kinematic cutoff the correct combination tends to be closer to 
the cutoff than the incorrect combination. We can take advantage of this by introducing a 
second cutoff. In the case of the Mt2 distribution we require that one combination have an 
Mt2 above this cutoff and the other is below. We then take the combination that is above 
this cutoff as the correct one. To select the cutoff we maximize the sensitivity eD^, where the 
e is the efficiency, D = 2P — 1 is the dilution, and P is the purity. This is a commonly used 
method for optimization in such situations [29]. Fig. |3|(a) shows that the maximum sensitivity 
occurs at an Mt2 cutoff at 460 GeV which would give us an overall combined efficiency of 
33% and purity of 84%. In the case of the invariant mass distribution, each combination 
has a pair of invariant masses so the cuts need to be modified slightly. From the two jet 
combinations we take the pair of invariant masses that have the largest separation between 
them. Placing a cut on the invariant mass we require that one invariant mass is above and 
the other is below the cutoff. We then take the combination with the invariant mass pair 
above the cutoff as the correct combination. Fig. ^(b) shows the scan of this invariant mass 
cutoff. We see that a cutoff of 230 GeV gives the largest sensitivity to this set of events. 
Combined with the results from the first set of cuts we obtain a total event efficiency of 41% 
and a purity of 79%. 

3.3 Uncertainty in particle masses 

Up to this point we have assumed the exact masses of the gluino and the lightest neutralino. 
What if the masses are not known with absolute certainty? To investigate the effects due to 
an uncertainty in mass measurements on the event efficiency and purity in finding the correct 
pairs, we generate events where both the masses of the gluino and neutralino fluctuate by 
10%. Then using exactly the same cuts as before {nijj < 500 GeV and Mt2 < 600 GeV for 
all mass spectra) we compute the event efficiency and purity, as shown in Table ^. 

A 10% increase in the gluino mass in general increases the invariant mass and Mt2 of 
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Figure 3: The sensitivity (solid, black), efBciency (blue, dot-dashed) and purity (red, dashed) as 
functions of the (a) Mt2 and (b) invariant mass cutoff. The cutoffs are used to pick the correct 
combination out of the events where only two combinations passed the original cuts. We see that a 
Mti cut (invariant mass cut) of 460 GeV (230 GeV) produces the largest sensitivity. Combining the 
results of the first set of cuts we have an overall efhciency of 33% and a purity of 84% (41% and a 
purity of 79%) with additional Mt2 cut (invariant mass cut). 



the jet pairs, effectively reducing the probabihty that the jet pair will pass the cuts. In other 
words, some events with all three combinations surviving cuts end up having only one or two 
combinations filtered by this variation. This results in an increasing event efficiency, but the 
contamination lowers the event purity. On the other hand a gluino mass that is 10% smaller 
decreases the invariant mass and of the jet pairs, which increases the probability that a 
jet combination will pass all the cuts. This will lower the event efficiency but will raise the 
event purity due to the fact that in events with only one combination passing all the cuts, 
that combination is more likely to be the correct one. 

The uncertainty in the neutralino mass produces the opposite effect on the event ef- 
ficiency and purity. Because the neutralino is a daughter particle, increasing (decreasing) 
the neutralino mass will increase (decrease) the probability of a jet combination passing our 
cuts producing the opposite effect to the event efficiency and purity than a similar change 
in gluino mass. However, it can be seen that the effect on the event efficiency and purity is 
much milder than that of the gluino due to the smaller change in neutralino mass. Therefore, 
overestimating the mass of the mother particle (or underestimating the mass of daughter 
particle) decreases efficiency and increases purity, while underestimating the mother particle 
mass increases efficiency and decreases purity. 

3.4 Dependence on mass spectrum 

In principle, this method does not require a large mass splitting between the gluino and 
neutralino in order to be effective. To quantify this we scan the gluino and neutralino mass 
over a range of values. Assuming that the gluino and neutralino masses are known, we take 



-9- 





m-0 = 90 

Al 


m-0 = 100 

Al 


m-0 = 110 

Al 


mp=660 

771^=600 

mg=540 


0.31/0.61 
0.21/0.92 
0.13/1.00 


0.30/0.63 
0.20/0.95 
0.12/1.00 


0.28/0.65 
0.19/0.96 
0.11/1.00 



Table 2: The event efRciency (left) and purity (right) with a 10% uncertainty in the gluino and 
neutrahno masses (in GeV). For all mass spectra the same cuts, rrijj < 500 GeV and AIt2 < 600 GeV, 
are applied. 





m^;o = 50 

Al 


m^o = 100 

Al 


m^o = 150 

Al 


m^o = 200 

Xi 


m^o = 250 

Xi 




=700 


0.15/0.94 


0.17/0.94 


0.20/0.93 


0.22/0.93 


0.28/0.93 


mg= 


=600 


0.17/0.95 


0.20/0.94 


0.24/0.93 


0.26/0.93 


0.28/0.92 


m-g-- 


=500 


0.21/0.94 


0.24/0.93 


0.28/0.93 


0.31/0.93 


0.36/0.91 


m-g-- 


=400 


0.25/0.94 


0.30/0.93 


0.34/0.91 


0.37/0.91 


0.43/0.87 


m-g-- 


=300 


0.30/0.93 


0.36/0.92 


0.42/0.90 


0.48/0.85 


0.54/0.80 



Table 3: The event efRciency (left) and purity (right) for several values of the gluino and neutralino 



masses (in GeV). For each mass spectrum the invariant mass and Mt2 cuts are given in Eqs. 3.6 and 



the following cuts on the invariant mass and Mt2'- 



nijj < m-g 

Mt2 < TTT-q 



Al 



(3.6) 
(3.7) 



The results are summarized in Table ^. 

In the increasing nig direction, the invariant mass and Mt2 distributions of the correct 
and incorrect jet combinations becomes more spread apart. In general this would increase 
the chance that a certain jet combination will be cut, reducing the event efficiency. However, 
the events that survive with only one passed combination will more likely be the correct 
combination, leading to a larger event purity. In the increasing m^^o direction, the invariant 

Al 

mass and Mt2 distributions of the correct and incorrect jet combinations becomes more 
compressed, resulting in higher efficiency and lower purity. 

3.5 EfTect of ISR 

Since gluino pair production gives a multi-jet final state, it is important to address an issue 
with isolating ISR jets. The ability to identify ISR is essential for using the discussed proce- 
dure: if the jets from ISR are not correctly isolated then it would be impossible to find the 
correct jet combination. Several different methods have been proposed to identify an ISR 
jet [7-14]. To analyze the gluino pair production with a single ISR jet we concentrate on 
the procedure introduced in Ref. [8]. In order to identify the ISR jet we use the following 
procedure. 

1. The jets with the highest momentum are not considered to be the ISR jet and are 
automatically placed in different gluino decay chains. 
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Number of 


Number of 


Number of 


Number of 


Number of events 


passed 


events 


events 


events 


with correct 


combinations 




with correct 


with Mx2 cut 


combination 






combination 




and Mt2 cut 





10 




17 




1 


454 


419 


580 


533 


2 


773 


754 


1103 


1083 


3 


1176 


1176 


1942 


1942 


total events 


2413 




3642 





Table 4: The breakdown of the number of events that have 0-3 combinations that pass the Mt2 and 
invariant mass cuts. The correct ISR jet has been identified using the two methods explained in the 
text. The events with one correct combination give us an event efficiency of 19% and 16% for the 
events without and with the Mt2 cutoff respectively. Both sets of events have an event purity of 92%. 

2. For the three other jets, each one is taken out and Mt2 is calculated from all the possible 
combinations of the remaining jets. The minimum Mt2 of the possible jet combinations 
is identified as m!^\. 

(i) 

3. The jet associated with the minimum of the is taken to be the ISR jet. 

Using this procedure alone we find that the ISR jet can be correctly identified 24% of 
the time. This number can be improved slightly by putting a minimum cutoff for the 
of min(Mj'2) > 500 GeV. In general, if a correct jet is taken out then the resulting Mt2 
becomes much smaller so this minimum Mt2 cutoff is designed to prevent the small Mt2 
that arises from taking out a hard jet. With this minimum cut included we find that the 
correct ISR jet can be identified 36% of the time. Both of these numbers are consistent with 
the results given in Ref. [8]. If we look at the events that have correctly identified the ISR jet 
and apply the same cuts as before to isolate the correct combination of jets, we can see how 
our procedure fares with the addition of ISR. The results are given in Table ^. We find that 
for the events where the ISR was identified without the Mt2 minimum cutoff, the efficiency 
of finding the correct combination is 19% with a purity of 92%. Including the minimum Mt2 
cut the efficiency is found to be 16% also with a purity of 92%. Though the efficiency drops 
slightly there are more events in this group where the correct ISR jet is identified. It is very 
important to note that these results were obtained from the events where the ISR jet was 
correctly identified. If we were to include the events where the incorrect ISR jet was identified 
then our purity will suffer. This exemplifies the importance of correctly identifying the ISR 
jet in this process. 

4. Application: the b-i ambiguity in the ti dilepton channel 

In this section, we apply the same idea as in Section ^ to resolve the two fold ambiguity that 
arises in tt production in the dilepton channel at the Tevatron and the LHC. Since there 
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are two leptons associated in this analysis, the corresponding background is already small. 
Requiring one or two b-tagged jets makes the background negligible. As an illustration, 
we estimated the cross sections for the signal and SM backgrounds at the 7 TeV LHC. 
We have used mt = 173 GeV. Assuming the cross sections summarized in Table ^, we 
simulate the signal (tt) and background events with PYTHIA [30] and further process them 
with PGS [31] for detector effects. We require two leptons, at least two jets, and missing 
transverse energy. The default cuts in each MC event generator are applied along with 
an extra dilepton invariant mass cut, 40 GeV < mu < 200 GeV. Table ^ summarizes the 
number of signal (S) and background (B) events in each channel for zero, one, and two b- 
tagged jets. The background cross sections are reduced significantly by requiring at least 
two leptons {SO/ BO ~ 3.27) and b-tagging further reduces the backgrounds {Sl/Bl ~ 24 for 
one b-tagged jet and S2/B2 ~ 75 for two b-tagged jets). Although this conclusion with a 
naive estimation of the background cross section may be affected slightly due to systematic 
uncertainties and higher order corrections, we do not expect significantly different results in 
this two-lepton and one or two 5-tagged final state. Therefore we will not address background 
issues in the rest of this section and will assume the background is negligible. 

For the analysis in the rest of this section, we generate lOK tt events using the Mad- 
Graph/MadEvent package at both the Tevatron and the 7 TeV LHC. We force the VF-boson 
to decay leptonically into either an electron or muon. For events simulated at the LHC, 
detector resolution was simulated as in Eq. |2.l| . For the Tevatron, the detector resolution is 
simulated following the parametrization in PGS [31], 

for jets, (4.1) 

for leptons . (4.2) 

We also assume that the ISR is correctly isolated. This can be justified given that we require 
two 6-tagged jets for our study on combinatorial issues. To separate the correct and incorrect 
combinations, we use the same method using the invariant mass of each b-i pairing as well as 
the M'r2 variable. The distributions for the correct and incorrect combinations for Mt2 are 
shown in Fig. ^ for both Tevatron (in (a)) and LHC (in (b)). The correct combinations have 
a cutoff at rritop, while the incorrect combinations have a tail extending beyond this cutoff. 
Fig. m shows the distributions for nihe, the invariant mass of the 6-jet lepton pairings at the 
Tevatron (in (a)) and the LHC (in (b)). Both show a kinematic edge at \ /mfgp — ^ 153 
GeV. 

Specific cut values, Mt2 < 176 GeV and nih£ < 156 GeV, are found by maximizing the 
sensitivity, defined as 5 = e{2P — 1)^. Events are selected if only one of the combinations 
passes these cuts. The efficiency (e) determines the fraction of events passing this criterion 
and the purity (P) is defined as the fraction of lepton-jet pairs that are correctly determined. 
One can optimize these cuts differently, depending on whether higher purity (but lower effi- 
ciency) or higher efficiency (but lower purity) is desired. Table |^ (Table ^) shows the results 
at the LHC (Tevatron) broken down by the number of combinations that pass the invariant 
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Process 


generator 


order 


tinai state 


cross section (pb) 


tt 


THEORY [32] 


NNLL resummation 


ti 


163 


W+ 


FEWZ [33] 


NNLO 


£+1/ + jets 


16670 


w- 


FEWZ 


NNLO 


£~z/ + jets 


11379 




MCFM 


NLO 


+ jets 


3000 


w+w- 


MCFM [34] 


NLO 


inclusive 


43 


W+ + Z/7* 


MCFM 


NLO 


inclusive 


11.8 


W + Z/7* 


MCFM 


NLO 


inclusive 


6.4 


Z/7* + Z/7* 


MCFM 


NLO 


inclusive 


5.9 



Table 5: Cross sections for 7 TeV pp collisions. I includes e and fj,. Z/^ + jets are computed for 40 
GeV < mu < 200 GeV with default cuts in MCFM. 





BO 


SO 


BO+SO 


Bl 


SI 


Bl+Sl 


B2 


S2 


B2 +S2 




608 


990 


1598 


61 


682 


743 


7 


246 


253 




97 


2261 


2358 


6 


1534 


1540 





509 


509 




663 


1220 


1883 


59 


822 


881 


7 


290 


297 


total 


1368 


4471 


5839 


126 


3038 


3164 


14 


1045 


1059 



Table 6: The number of signal (S) and background (B) events in each channel. The numbers (1 and 
2) represent the number of b-tagged jets. Applied cuts are nominal reconstruction cuts at PCS level 
and dilepton invariant mass cut, 40 GeV < mu < 200 GeV. 

mass and Mt2 cuts in each event. When the restriction that only one combination passes 
the cuts is applied, we obtain an efficiency of 51.7% (39.9%) and a purity of 94.9% (91.9%). 

The CDF collaboration uses several methods to correctly pair the 5-jet and the lepton [29]: 
m^""^ , the kinematic method, and the neutrino phi weighting method. Let us explain each 
method briefly. 

m^^^ m,ethod: Since there are two leptons and two 6-jets in each event, there are two 
exclusive pairing options so there are in total four values. One takes the pairing option 
having the maximum m^^ as the incorrect one. As shown in Fig. ^, above a certain value, 
larger m^^ comes often from the incorrect combination. To improve the purity of pairing, 
the events that have maximum m^^ larger than a certain cut value are selected. A larger 
cut value improves the purity at the cost of reduced efficiency. In order to find the best cut 
value (leading to maximum sensitivity, e{2P — V)^) on m^"^, one can scan through various 
maximum invariant mass cut values and select the events having maximum invariant mass 
larger than the cut. The efficiency and purity are calculated for each cut value. 

Kinematic method: the kinematic method is a variation of the method used in the top 
mass analysis in the dilepton channel. This method is based on resolving the full energy- 
momentum conservation equation set for all the particles in the final sates. By a process of 
eliminating variables one is left with a 4th-order polynomial equation. Therefore up to 4 real 
solutions are possible. The CDF collaboration uses a numerical method to solve equations 
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M^(GeV) 



Mr2 (GeV) 



Figure 4: The Mt2 distributions of the 6-£ combinations in the final state of the tt pair production 
at the Tevatron (a) and the LHC (b). They show the clear kinematic edges at mtop- The distribution 
of correct combinations is shown in blue (solid), while the incorrect one is in red (dotted) and all 
combinations are in black (dot-dashed). 




m, , (GeV) m,, (GeV) 



Figure 5: The invariant mass (m^) distributions of the h-i combinations in the final state of the 
tt pair production at the Tevatron (a) and the LHC (b). They show the clear kinematic edges at 

/ 2 2 

and look for up to two solutions. If there are two solutions found, they choose the one having 
smaller m^f. A variation of this method has been used in studies of spin correlation of the 
ti system [35]. The kinematic method has slightly higher efficiency than the m^"^ method 
but its purity is significantly lower [29]. Performance of the neutrino phi weighting method 
is similar to the kinematic method [29]. The matrix element method can be also used but we 
do not find any comments on the pairing purity in the literature. 

As a comparison with our method, we have considered the m^J^^method (denoted as 
{mu-, none) in Figs. ^ and ^, and Tables |9| and |l^) used by the CDF collaboration. Fig. 
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Number of passed 


Number of events 


Number of events 


combinations 




with correct combinations 





248 




1 


5173 


4908 


2 


4575 


4416 



Table 7: Results broken down by number of combinations passing the both the invariant mass and 
Mt2 cuts for events at the 7 TeV LHC. By selecting events with only one event passing, an efficiency 
of 51.7% with a purity of 94.9% is obtained. 



Number of passed 


Number of events 


Number of events 


combinations 




with correct combinations 





230 




1 


3990 


3665 


2 


5778 


5580 



Table 8: The same as Table [7| but at the Tevatron. The corresponding efficiency and purity are 
39.9% and 91.9%. 

|6| shows the purity-efficiency relation with the sensitivity at the Tevatron (in (a)) and the 
LHC (in (b)). This one-dimensional relation is due to a degree of freedom in choosing the 
invariant mass cut. The original CDF method is denoted as (nihi, none) (solid, red) while 
(Mt2) none) (dashed, black) represent the same scheme with rribi replaced by Mt2- At the 
Tevatron when a cut of rribg > 151 GeV is made, the maximum sensitivity is obtained and 
this method results in an efficiency of 37.2% and a purity of 96.3% for top quark production. 
The corresponding cuts for {rrihi, none) are are shown in Table ^. These are comparable to 
results obtained with our proposal. For the LHC, both purity and efficiency are higher (see 
Fig. |(b) and Table 0). 

While the initial requirement of only one combination passing the cuts for our method 
results in reasonable values for efficiency, we can still improve this by making use of the events 
in which both combinations pass the cuts (nearly 50 percent of the events for the LHC). It is 
possible to implement another reconstruction method on these events. One example would 
be to combine our method with the m^'^^method to improve the total number of events in 
which a good combination can be selected, while not causing too large of an adverse effect on 
the purity. In this case, however, we do not find a noticeable improvement over the original 
method. 

Instead, we take a different approach to increase the event efficiency and purity. The 
CDF procedure cuts out events in which the combination with the highest m^^ or Mt2 is 
not over the cut value, thus giving a lower efficiency. We look at those events and apply a 
second cut on the variable not used in the first case, thus applying the method with Mt2 for 
events not passing the nibi selection, {rribi, Mt2), and vice versa, {Mt2, rribtj- We find that 
the Mj'2 and invariant mass together with the CDF scheme lead to much better efficiency 
and purity than those obtained with one of them only. Fig. ^ shows contours of the efficiency 
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Figure 6: The efRciency, purity and sensitivity using the CDF method at the Tevatron (a) and the 
LHC (b). The original CDF method is denoted as (tom, none) (sohd, red) while (Mt2, none) (dashed, 
black) represent the same scheme with niM replaced by Mt2- The contours of constant sensitivity 
are shown in dotted (blue) curves. The dots represent the efficiency and purity that results in the 
maximum sensitivity, and the corresponding cuts are shown in Tables and [Tc|. 




Figure 7: Contours of the efRciency (red, dashed), purity (blue, dotted) and sensitivity (black, solid) 
in the MT2-'mM plane at the Tevatron (a) and the LHC (b). The dots represent the efRciency and 
purity that results in the maximum sensitivity. The corresponding cuts are shown in Tables ^ and 



(red, dashed), purity (blue, dotted) and sensitivity (black, solid) in the Mt2-^m plane at the 
Tevatron (a) and the LHC (b). The cuts that correspond to the maximum sensitivity are 
shown in Table 10. Similar results are obtained for both {mu, Mt2) and {Mt2, mbi) cases, 
and this shows the order of the two cuts is not crucial. With both kinematic cuts, we improve 
the efficiency by 20% while keeping the purity at the same level or slightly higher. 

While in this section we have discussed the combinatorial ambiguity in the b-l pairings 
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cut 


cut value (GeV) 


ernciency (ej 


purity [J-^j 


sensitivity (oj 


(niM, none) 


(151, none) 


0.372 


0.963 


0.319 


[Mt2i none) 


(174, none) 


0.294 


0.972 


0.261 


{niM, Mt2) 


(158, 176) 


0.426 


0.981 


0.398 


(Mt2, rube) 


(175, 155) 


0.442 


0.971 


0.392 



Table 9: Results at the Tevatron for invariant mass cut, Mt2 cut and the hybrid, following CDF 
scheme. The sensitivity is defined as S" = e{2P — 1)^. 



cut 


cut value (GeV) 


efficiency (e) 


purity (P) 


sensitivity {S) 


{mbi, none) 


(152, none) 


0.478 


0.988 


0.456 


[Mt2i none) 


(172, none) 


0.395 


0.981 


0.366 


{mbi, Mt2) 


(152, 174) 


0.574 


0.985 


0.539 


{Mt2, mbi) 


(175, 155) 


0.557 


0.989 


0.533 



Table 10: The same as Table § but for the LHC. 



in the dilepton channel, the same method can be used in principle in the semi-leptonic 
channel as well. In the latter case, often a (or a likelihood function) is defined with 
mass-shell constraints of the top quark and the W boson on the both hadronic and leptonic 
sides, assuming the missing transverse momentum is solely due to the neutrino from the 
W decay. In the procedure, from the on-shell condition of the leptonic W , an additional 
two-fold ambiguity in sign for the z-component (along the beam direction) of the neutrino 
momentum is necessarily introduced. The correct combination and sign are determined by 
minimizing the (or maximizing the likelihood function). The kinematic variables such 
as Mt2 and invariant mass should be able to help to increase the probability of finding the 
correct combination, when they are properly included in the or the likelihood functions. 

We have assumed both 6-jets are tagged but the proposed method here should apply in 
the final state with one 6-tagged jet as well as in the dilepton channel with no 6-tagged jets. 
In these cases, the candidates for the 5-jet can be determined by different selection criteria. 
For instance, in the final state with one 6-tagged jet, the hardest remaining jet could be a 
good candidate for the other 6-jet while for the final state with no 6-tagged jets, the two 
hardest jets are good choices. However the purity of the ti sample in the latter case will 
decrease significantly. 

5. Summary and conclusions 

The Mt2 variable is originally introduced to measure masses of semi-invisibly decaying par- 
ticles. It is the most natural extension to the transverse mass that was used for the W 
discovery and mass measurement. For the last few years, there have been many studies to 
determine masses, spins and couplings. Most of them utilize the kinematics of events with 
missing energy. The Mj'2 variable is especially useful for relatively short decay chains where a 
traditional invariant mass method is limited. The subsystem Mj'2 allows for measurement of 
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all masses in the decay chain, assuming combinatorial issues arc sorted out. It is also shown 
that the Mt2 variable shows an interesting kink structure due to the compositeness of the vis- 
ible sector, the existence of ISR, or upstream transverse momentum from the heavier particle 
decays. It has been extended to include different mother /daughter particles masses. Finally 
it has been proposed as a potential solution for isolating ISR from signals with multiple jets. 

In this paper we have investigated the feasibility of the use of Mt2 to resolve combinato- 
rial issues at hadron colliders. We concentrated on a 4-jets signal where 3 possible partitions 
exist. We have compared the performance of this method with the px versus invariant mass 
method and showed that efficiency increased up to a factor of 5 for the same purity of the 
sample. One of the advantages of the kinematic methods is that there is little dependence 
on the mass spectrum (model dependence), unlike a choice oipT- We have found that our 
results for efficiency and purity remain similar for different choices of mass parameters. We 
showed that the kinematic variable suggested for mass determination is also useful for resolv- 
ing combinatorial issues in signals, as well as for isolating ISR jets from signal jets. 

A similar idea is applied to the ti dilepton system and we found that the obtained 
efficiency and purity are comparable to those using the current CDF method. A variation 
of their method with Mt2 results in improvement of the efficiency by 20% at the Tevatron 
and 25% at the LHC, while the corresponding purity remains the same. It is desirable 
and important for experimental collaborations to revisit the method with a full detector 
simulation. 
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